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^~^' Abstract. We present a generalization of standard Turing machines based 

C^ ' on allowing unusual tapes. We present a set of reasonable constraints on tape 

geometry and classify all tapes conforming to these constraints. Surprisingly, 
this generalization does not lead to yet another equivalent formulation of the 
notion of computable function. Rather, it gives an alternative definition of 
the recursively enumerable Turing degrees that does not rely on oracles. The 
definitions give rise to a number of questions about computable paths inside 
Cayley graphs of finitely generated groups, and several of these questions are 
r S ' answered. 

a • 1. Introduction 

When Alan Turing originally defined his a-machines, which would later be called 
Turing machines, he envisioned a machine whose memory was laid out along a one- 
dimensional tape, inspired by the ticker tapes of the day. This seemed somewhat 
i^ , arbitrary and perhaps unduly restrictive, and so, very quickly, machines with multi- 

«ir ' pie and multi-dimensional tapes were proposed. The focus at the time was defining 

\^ , the term "computable" , and as adding tapes and dimensions defined the same class 

^SJ ' of functions as Turing's original, simpler model, studying alternate tape geometries 

ly-^ , fell out of favor for some time. 

f*"^ ' The complexity theory community then reignited interest in alternative tape 

^^ , geometries by considering not the class of functions computable by Turing machines, 

but time and space complexity of functions on different tape geometries. This led 
to a number of results about relative efficiency of machines with one/many tapes 
i^j ■ and one/two/high-dimensional tapes. For example, the language of palindromes 

rS I (those strings that read the same forward and backward) can be computed in 0{n) 

j^ ' time on a two-tape machine, but requires il(n^) on a one-tape machine with one 

read/write head. Or, an jTi-dimensional Turing machine running in time T{n) can 
be simulated by a fc-dimensional Turing machine (k < m) in time T{n)^^~^~^'^ 
for all e > [T]. 

Many of the proofs and algorithms used in the study of multidimensional Tur- 
ing machines make their way into or are inspired by the world of mesh-connected 
systems. Mesh-connected systems are arrays of identical, relatively dumb proces- 
sors (typically with memory logarithmic in the input size) that communicate with 
their neighbors to perform a computation. Time use on a Turing machine with a 
d-dimensional tape is intimately tied to the power use of a d-dimensional mesh of 
processors with finite memory, so there is some natural crossover. Mesh-connected 
systems constitute an area of very active research now, but since it remains very 
closely tied to the physical implementation, research is generally restricted to two- 
and three-dimensional grids. 
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In this paper, we go beyond the world of rectangular grids and consider tapes at 
their most general. The purpose of this is two-fold. First, to give the complexity 
theorists a general framework in which to work, subsuming all current tape-based 
Turing models. Second, to provide some evidence that alternative tape geometries 
are interesting from a recursion-theoretic perspective. Along the way, we will en- 
counter some questions in combinatorial group theory that aren't directly related 
to generalized Turing tapes, but are interesting in their own right. 

Any Turing tape can be modeled as a digraph with nodes corresponding to 
tape cells and edges corresponding to allowable transitions. Hence, we start by 
introducing a number of graph-theoretic conditions that ought to be satisfied by 
a reasonable tape. It turns out that the criteria we outline are necessary and 
sufficient conditions for the graph to be the Cayley graph of a finitely generated, 
infinite group. 

We then turn to the question of whether allowing arbitrary Cayley graphs as 
tapes is just another equivalent machine model for the class of computable functions. 
Interestingly, this will depend on the structure of the group from which the Cayley 
graph is generated. For groups with solvable word problem, this does indeed lead to 
machines that compute the class of computable functions, however for groups with 
unsolvable word problems, these machines are strictly more powerful than standard 
Turing machines. In fact, they can be as powerful as any oracle machine and we 
end up with an alternative definition of the Turing degrees that is machine based 
and doesn't rely on oracles. 

The constructions and proofs of these results begin to raise questions about 
what kind of computable objects we can hope to find in arbitrary Cayley graphs. In 
particular, whether we can always find an infinite, computable, non-self-intersecting 
path. We call such a path an escape and we construct a group without any escapes. 
This construction is non-trivial as any group without an escape must also be a 
Burnside group, which we also prove. 

Throughout the rest of this piece, all Turing machines will have a single tape and 
a single head. This is the easiest case to treat and the generalization to multiple 
tapes and multiple heads is entirely analogous to the standard Turing picture. 

2. General Tape Geometries 

Any tape essentially consists of a collection of cells, each of which can hold a 
single symbol, together with a mechanism for moving from one cell to another. 
So underlying any tape is an edge-colored digraph. Edges in this graph represent 
allowable transitions between states and the edge coloring encodes the conditions 
on the stored symbol and control state under which that particular transition oc- 
curs. In order to be a reasonable Turing tape, this digraph should satisfy a few 
restrictions. 

(1) Uniqueness of outgoing colors - From any vertex, there should be exactly 
one outgoing edge of each color. Since the mechanism by which the tape 
head moves is encoded in the edge color, outgoing edges should have dif- 
ferent colors. Also, since the transition function is independent of the tape 
cell, the collection of colors going out from each vertex should be the same. 

(2) Homogeneity - Every vertex should "look like" every other vertex. Tech- 
nically, this means that the subgroup of the automorphism group of the 
graph that preserves edge colors should be vertex transitive. This is an 
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extension of the assumption that all tape cells are indistinguishable, in this 
case by the local geometry. 

(3) Infinity - The tape should have an infinite number of cells. Otherwise, it's 
just a finite automaton. 

(4) Connectedness - As the head moves during the course of a computation, it 
remains on a single connected component. Inaccessible states are useless, 
so we can require that every tape cell be accessible from the starting point. 
In particular, this means that the graph is connected. 

(5) Finitely many colors - The transition function of the TM should be finite, 
so there should only be finitely many outgoing colors. Having more colors 
doesn't change the computational power, since only finitely many of them 
could be referenced by the transition function anyway. 

(6) Backtracking - The Turing machine should be able to return to the cell it 
just came from. This assumption is less essential, since in view of homo- 
geneity, any algorithm that called for returning to the previous tape cell 
could be replaced by a fixed sequence of steps. However, many algorithms 
call for the head to return to the previous cell and forcing the head to do so 
by a circuitous route seems unduly harsh. Note that in view of homogeneity, 
the color of an edge determines the color of the reverse edge. 

Restrictions 1 and 2 imply that our tape is a Cayley graph, restrictions 3, 4, and 5 
make it the Cayley graph of a finitely generated infinite group, and restriction 6 
forces the generating set to be closed under inverses. In addition, any Cayley graph 
of of a finitely generated infinite group with a generating set that is closed under 
inverses satisfies 1-6. This suggests that Cayley graphs are, in some sense, the 
"right" degree of generality and leads to the following definition: 

Definition 2.1. Let G be an infinite group and 5 C G be a finite generating set 
for G that is closed under inverses. Then the Cayley graph of G generated by S is 
called the tape graph, (G, S). 

Using this general type of tape, we can then ask questions about the structure 
of Turing Machines with tapes given by assorted groups and generating sets. 

3. Turing Machines on Cayley Graphs 

Definition 3.1. Let G = (gi, . . . ,gn) be a finitely generated group with the set 
{gi, . . . , gn\ closed under inverses. Then a Turing Machine over G with generating 
set (gi, . . . , gn) is a 7-tuple, (Q, F, 6, S, 5, qo, F) where 

• Q is the finite set of states 

• F is the finite set of tape symbols 

• 6 S F is a designated blank symbol 

• S C F\{6} is the set of input symbols 

• S: QxT^QxTx {gi, . . . ,gn} is the transition function 

• 9o G Q is the initial state 

• -F C Q is the set of terminal states (typically one to accept and one to 
reject) 

This definition varies from the standard definition only in the interpretation 
of the transition function. Whereas a standard TM has a two-way infinite one- 
dimensional tape and the transition function includes instructions for moving left 
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or right, a TM over G has as a tape the Cayley graph of G and the transition func- 
tion has instructions for moving along edges labeled by a particular generator. For 
example, a Turing Machine over Z with generating set { — 1, +1} is a standard one- 
dimensional TM and a TM over Z^ with generating set {(0, -1), (-1, 0), (0, 1), (1, 0)} 
is a standard two-dimensional TM. 

We have intentionally skipped the notion of how to provide input for machines of 
this type. Most generally, we could insist that the initial set of non-blank tape cells 
be connected and contain the initial location of the tape head. However, we really 
intend these machines to work like Turing machines, and therefore, to compute on 
strings of symbols. It turns out that there will be a canonical way to lay out strings 
on the Cayley graph, but we need some machinery first. 

3.1. A Well-ordering in Trees. We shall turn aside from the main topic for 
a moment to discuss a general statement about trees. There are many ways to 
define an order on the vertices of a tree, but we are going to be interested in 
the lexicographic order. In general, lexicographic orderings on trees have few nice 
properties, but we show that finitely branching trees have a subtree where the 
lexicographic order is in fact, a well-order. 

First, some definitions. Let T be a finitely branching tree. Denote by [T] the set 
of all infinite paths through T and by E the partial ordering on vertices induced 
by the tree. Our convention will be that v \Z w means that v is closer to the root 
than w. 

In order for the lexicographic ordering on T to even make sense, we must have 
a linear order on the set of children of each node. Denote the order on the children 
of f G r by <^,. Then the lexicographic order, < is defined as follows: 

• li V \Z w, then v < w. 

• If neither v \Z w nor w \Z v, let u he the greatest lower bound of v and 
w according to IZ and let u' \Z v and u" IZ w be children of u. Then 
V < w <;=> u' <u u" . 

This order can, in fact, be extended to an order on T U [T]. Identifying elements 
of [T] with subsets of T and elements of T with one-element subsets of T, we can 
define 

X < y ^=> {3w e 2/)(Vw G x)v < w 
Defined in this way, < is a linear order, but we can't really hope for any more 
structure than that. But, as promised, with a bit of pruning, we can find a subtree 
with much more structure. 

Theorem 3.2. Let T be a finitely branching tree and let < be the lexicographic 
ordering on the nodes of and paths through T as given above. Define 

T' = {i;eT|Vwe [T],v<w} 

Then < restricted to T' is order isomorphic to an initial segment of uj. In addition, 
if T is infinite, T' is infinite as well. 

Proof. This follows from the following direct result of Konig's Lemma. 

Lemma 3.3. Every element of T' has only finitely many <-predecessors. 

Proof. Suppose w G T' had infinitely many <-predecessors. Then we could form 
the tree, 

S = {w E T'\w < v} 
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This is, in fact, a tree since T' is a tree and x \Z y implies x < y. Since v has 
infinitely many <-predecessors, S is infinite. 

By Konig's Lemma, there must be a path through S, call it P. But T' is a 
subtree of T so P £ [T]. By definition of <, P < t;, but v &T' so v < P. This is a 
contradiction, so v must have only finitely many predecessors. D 

Any linear order in which every element has only finitely many predecessors 
clearly cannot have an infinite descending chain, so must be a well-order. As w + 1 
has an element with infinitely many predecessors, the order type must be an initial 
segment of lj. 

For the second part of Theorem 13.21 we need an additional lemma. 

Lemma 3.4. If\T] is non-empty, then [T] has a minimal element in the < ordering. 

Proof. We can inductively construct the minimal element of [T]. For any u G T, 
define s{v) to be the minimal (according to <y) child of w that is a member of some 
element of [T] if such a vertex exists. Note that if there is a path through v, s{v) is 
defined and there is a path through s{v). If vq is the root, then P — {s*^"^(wo)}nGN 
is the desired minimal element. 

Since [T] is non-empty, there is a path through the root and so, by induction 
s^"H^o) is defined for all n. Therefore P is indeed a path. 

To see that P is minimal, let P ^ P' e [T]. Let v = s'-'^^vq) be the largest 
element (according to c) of P n P' and let w e P' be a child of v. By maximality 
oi V, w ^ s(v) and by construction, s{v) <^ w. Therefore, s{v) <v w. By the 
lexicographic ordering, w is greater than all descendants of s{v) and also greater 
than all ancestors of s{v) (since ancestors of s{v) are also ancestors of w). Therefore, 
w > u ior all u e P and P' > P. D 

Now, let T be infinite. Then, by Konig's Lemma again, [T] is non-empty. Let 
P be the minimal path in [T] according to Lemma 13.41 Then P C T' since for any 
w G P and P' e [T], w < P < P'. P is infinite, so T' is infinite as well. D 

3.2. Power of Turing Machines on Cayley Graphs. One of the first questions 
to be asked about any new model of computation is whether the class of functions 
computable by the new model is different from the class of computable functions. 
For Turing machines on Cayley graphs, this depends rather sensitively on properties 
of the group producing the tape graph. For example. 

Lemma 3.5. Let (G, S) be a tape graph. There is a Turing machine over (G, S) 
that can solve the word problem for G. 

This will not be proved rigorously, since we have not yet defined how input is 
to be provided, but we will provide an argument that can be made rigorous in an 
obvious fashion by the end of this section. 

Given two sequences of generators, the machine can simply follow the first se- 
quence of generators, leaving a pointer at each cell along the way pointing to its 
predecessor. Marking the end, it can follow the sequence of pointers back to the ori- 
gin. Now, it can follow the second sequence of generators and check to see whether 
the end point was marked in the first step. Clearly, if this algorithm ends on the 
marked cell, the two sequences of generators correspond to the same group element. 

Boone and Novikov [1] [8] independently showed that there exist groups with 
undecidable word problems, so this leads us to believe that Turing machines on a 
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given group with unsolvable word problem are strictly more powerful than standard 
Turing machines. However, this requires that Turing machines over said group also 
be able to compute all computable functions. Fortunately, this is the case. 

Theorem 3.6. Let 

be a standard one- dimensional one-tape Turing Machine and let (G, S) be a tape 
graph. Then there is a Turing Machine over (G, S) that can simulate M . 

The simulation itself is very straight-forward. The only difficulty stems from 
the question of how to arrange the tape contents of the simulated machine on the 
Cayley graph. If we could compute an infinite non-self-intersecting path through 
the Cayley graph, we could use this as a standard one-dimensional tape and do the 
simulation there. However, as we will see in Section |4l such a path need not exist. 

Fortunately, we can do the simulation anyway, in this case, by a variant of 
the "always turn left" algorithm for solving mazes. By putting an ordering on 
the generators of our tape graph, "always turn left" becomes "always follow the 
lexicographically minimal edge" . So, in an appropriate tree. Section 13.11 gives us 
a well-ordered subtree where we can do the simulation with the nth vertex in the 
well-ordering storing the contents of the ?ith tape cell. 



3.3. Proof of Theorem 13.61 We will start with a description of the tree where 
we will store the tape of the simulated machine. After we describe the operation of 
the machine, we will prove that this tree can be constructed on-line and navigated 
effectively. 

We will routinely use the natural correspondence between sequences of generators 
and group elements given by forming the product of the generators in the given 
sequence and evaluating in the group. Henceforth, sequences and group elements 
will be used interchangeably. Of course, multiple sequences will correspond to the 
same group element, but the sequence should always be clear from context. 

Definition 3.7. Let G be a group. A super-reduced word is a finite sequence 
of elements of G such that no subword, taken as a product in G is equal to the 
identity. More precisely, it is a sequence, gi, . . . , <;„ such that for all 1 < z < j < n, 
ni=i 5fe 7^ e in G. 

In the context of tape graphs, super-reduced words with symbols from the gener- 
ating set correspond exactly to non-intersecting finite paths through the tape graph. 
Note that a word is super-reduced if and only if all of its prefixes are super-reduced. 

Form the tree, T, of super-reduced words with symbols from S. Since every 
group element has at least one super-reduced word corresponding to it and G is 
infinite, T is a spanning tree for the Cayley graph of G, hence, infinite. Therefore, 
we can construct an infinite T' as in Section 13.11 where the lexicographic ordering 
is a well-ordering. We will not do the computation on T' , but on a subtree, which 
we will call R. To define R we will want another definition. 

Definition 3.8. We say that two sequences of generators, v and w, are equivalent 
in G, or v =g w if they correspond to the same group element. In other words, 
V =G w a V = (si, . . . , s„), w = (ri, . . . , r„) and 
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Now we can define R as follows, 

R={v eT'li'^w eT')v=GW =^ v<w} 

That is, i? is the set of vertices in T' that are lexicographically minimal among 
sequences that represent the same group element. 

It's not obvious at first glance, but i? is a tree. Suppose to the contrary that 
uv = w ^ R but u ^ R. Then there is some u' < u in R corresponding to the 
same group element. Since we began with the tree of super-reduced words, u and 
u' are incomparable in the tree order. Therefore, u and u' must differ at some first 
location. So, u'v =g w but uv and u'v differ for the first time at the same location 
and since m' < u, u'v < uv ~ w. This is a contradiction since w was supposed to 
be minimal among words corresponding to the same group element, so R is indeed 
a tree. 

Also non-obvious is the fact that R is infinite. By the proof of Theorem 13.21 
T' contains a path. As was noted earlier, since we began with the tree of super- 
reduced words, elements that are comparable in the tree order cannot correspond 
to the same group element. Therefore, the path in T' corresponds to an infinite 
collection of group elements. R represents the same group elements since we only 
pruned redundant representations, so R also represents an infinite number of group 
elements and is therefore infinite. 

Now for any tape graph, we have a tree that is well-ordered lexicographically, 
represents infinitely many group elements and represents each individual element 
at most once. This is where we will do the simulation. 

Let (G, S) be the tape graph given in the statement of the theorem. Denote the 
elements of 5" by gi, . . . , gn- It will be convenient to define a set S' = S'U {goj 9n+i} 
with the ordering go < gi < ■ ■ ■ < gn < gn+i- We will consider both go and gn+i 
equal to e in the group for the purposes of moving from node to node. Define a 
Turing Machine, iV, over (G, S) as follows: 

• Qn = Qm U {Qm X S") U {Qm X S") U {Qm xS)U {Qm x S) 

• Tn = Tm X S" X V{S) X V{S) 

• bN ^ (6m, go, 0,0) 

• Ew = Sm X S" X ViS) X V{S) 

• QON — lOM 

• Fn=FmxS' X V{S) X V{S) 

Each state in the tape alphabet will have an intended meaning. Remember that 
we are going to do the computation on a tree, so we have to encode in each node 
not just the symbol of the simulated machine, but also auxiliary information about 
the structure of the tree. In particular, if (^,a, A^B) G F^r, 7 is the symbol of 
the simulated machine stored at the node, a is the generator to follow to reach the 
ancestor of this node in the tree, A is the set of generators corresponding to edges 
pointing away from the root in the tree, and B is the set of generators defining non- 
edges of the tree. Remember that we are going to be constructing this tree on the 
fly and so we don't have complete information about which generators correspond 
to edges of the tree at every step. Thus, elements of S\{A[JB) are the edges whose 
membership in the tree has not yet been determined. 

We will also give each state a name and an intended interpretation, 

• Cq for q G Qm: We are simulating the computation of M and the current 
state of M is q. 
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• Rqx for q G Qm and x E S': The simulated tape head is moving to the 
right and M is currently in state q. The argument x encodes the edge we 
followed to reach our current location. 

• Lqx for q G Qm and x G S': The simulated tape head is moving to the 
left and M is currently in state q. The argument x encodes the edge we 
followed to reach our current location. 

• Eqx for q G Qm and x E S: The simulated tape head is moving to the 
right and M is in state q, but we have run out of tape and are attempting 
to extend the tree along edge x. 

• Bqx for q G Qjm and x € S: M is currently in state q and we just failed to 
extend the tree along edge x, so we are backtracking. 

Note that the starting state of N is named CqoM- 

It will also be convenient to talk about the component functions of the transition 
function of M, 



Si 
S2 
S3 



Qm X ^M —^ Qm 
Qm X Tm -^ Tm 
Qm X Tm -^ {L, R} 



We can now write down the action of the transition function. If the current 
symbol being read is (7, a, A, B), then the value of the transition function on each 
type of state is given in Table [TJ Entries in the table are triples in the order, new 
state, symbol written, direction the tape head moves. 



fgt 



N 



5n{FAi,(t,A,B)) 



Cq 
Lqx 

Rqx 

Eqx 
Bqx 



UR5i{q,^)go,{S2{q,j),c7,A,B),gQ) if (53(9,7) ^R 
\ {L61 {q, -f)a, {S2 {q, 7), CT, A, B), a) if S^{q, j) = L 
f {Cq, (7, a, A,B),ga) if Vy G A, y > a; 
I {Lqgn+i, (7, o-. A, B), ma.Xy(zA,y<x y) otherwise 
{Cq, (7, CT, A, B), m\ny(zA,y>x v) if 3y G A, y > x 
{Eqy, (7, cr, A U {y}, B), y) where y = m\\\^(,s\B, 



S\B,z>x Z 

if 3z G S\B, z>x 



{Rq(T,{'-f,a,A,B),(T) otherwise 



f(Cg,(7,a:-i,0,{x-i}),5o) if ^ = i? = 
]{Bqy,{^,a,A,B),x~^) otherwise 

{Rqx,{j,a,A\{x},BU{x}),go) 
Table 1. Transition Table for N 



Most of these transitions are pretty opaque, so some explanation is warranted. 

If we are in a C-state, then we perform one step of the computation and then 
transition into either an _R-state or and L-state depending on whether the compu- 
tation tries to move left or right. A leftward step in the simulated machine always 
begins with a step toward the root for the simulating machine, so we take that step 
immediately. Rightward steps are more complicated, so we leave the tape head 
where it is on a rightward step. 



TURING MACHINES ON GRAPHS AND INESCAPABLE GROUPS 9 

Taking a step to the left, we want to find the lexicographic immediate predecessor 
of our current vertex. We begin by taking one step toward the root, which we did 
when we transitioned out of the C-state. Then, either there is a branch all of 
whose elements are less than where we started, or not. If not, then we are at the 
immediate predecessor of our origin, and we continue computing. Otherwise, follow 
the greatest such branch and always move to the greatest child until we reach a 
dead-end. This dead-end is the immediate predecessor we were looking for, so we 
can continue the computation. 

Taking a step to the right is significantly more complicated, as we are most likely 
going to have to extend the tree as we go. If we have already built some tree above 
us [A is non-empty), then we simply use that part of the tree, moving along the 
minimal edge in the tree and continuing the computation. This is the reason for 
using the "false" generator go when we transition out of the C-state. Otherwise, 
try to extend the tree along the least edge that we have not already ruled out. We 
add that edge to A and switch to an _E-state. 

If A and B are both empty, then this is a new vertex that we have not visited 
before, so continue with the computation. Otherwise, we are at a vertex that has 
already been added elsewhere in the tree. So, we back up and transition to a in- 
state. In the i?-state, we rule out the edge we just took by removing it from A and 
adding it to B and switch to an i?-state to try extending the tree again. 

If we can't extend the tree at all {B = {gi, . . . , g„}), then take a step toward the 
root and try again, eschewing anything less than or equal to the edge we backtracked 
along. Since there is an infinite tree for us to use, we will eventually be able to 
extend the tree. 

We are now in a position to be more explicit about input. Take a machine with 
one tape given by a tape graph, (G, S) and another, read-only one-way standard 
input tape. Then, in view of the construction above, it is clear how this machine 
would transcribe its input from the standard tape onto the tape graph. Once 
the transcription is done, the computation can proceed according to the above 
construction. Thus, it seems reasonable to require the input to a machine be the 
result of a transcription of this type. This does require some (potentially) non- 
recursive manipulation of a string to produce the appropriate input, but if this 
offends you, it is always possible to go back to the formalism with an auxiliary 
read-only input tape. 

3.4. An Alternative Characterization of the Turing Degrees. We have demon- 
strated that Turing Machines on arbitrary Cayley graphs are strictly more powerful 
than standard Turing Machines, so the next question to ask is, "how much more 
powerful?" The short answer is "as powerful as we want" . In j[2j [3j Boone showed 
that for any r.e. Turing degree, there is a finitely presented group whose word 
problem is complete for that degree. Using such a group, we can produce a ma- 
chine (more precisely a class of machines) that computes exactly the functions in 
or below the given degree. More precisely. 

Theorem 3.9. Let T he an r.e. Turing degree. There is a group, G, such that the 
class of functions computable by a Turing Machine over G is exactly the class of 
functions in or below T . 

Proof. By Boone, let G be a group whose word problem is complete for T and let / 
be a function in or below T. Since / < T, / is Turing reducible to the word problem 
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for G. Turing machines over G can perform this reduction since they can compute 
all recursive functions and they can solve the word problem for G by Lemma 13.51 
Therefore, they can compute /. 

To show that a Turing Machine over G cannot compute a class larger than T , 
observe that a Turing Machine with an oracle for the word problem for G can easily 
simulate a Turing Machine over G. It simply maintains a list of nodes written as 
a sequence of generators for the address together with whichever tape symbol is 
written there. When the simulated tape head moves, the machine simply appends 
the generator to the current address and consults the oracle to determine which 
node this corresponds to, adding a new entry if the new node isn't in the list. D 

4. Inescapable Groups 

4.1. Construction of an Inescapable Group. Theorem 13.61 raises the question 
of whether or not a Turing Machine can always walk off to infinity on the Cayley 
graph of an infinite group without retracing its steps. This suggests the following 
definition: 

Definition 4.1. An inescapable group is a tape graph, (G, 5), such that any infinite 
computable sequence, s, of elements from S corresponds to a self-intersecting path. 

This definition leads to a number of questions. Do such things exist? What do 
they look like? Is inescapability a group invariant? Very little is known in response. 
For example, as we will see in Section l4?2l an inescapable group must be a Burnside 
group; that is, every element must have finite order. The purpose of this section 
is to answer the more fundamental question of whether or not inescapable groups 
exist. 

4.1.1. Definitions. We will need to make a few definitions. Let ^ be a finite set. 
Let A* — A'^'^ be the set of all finite sequences with elements from A, called words 
over A. Let e denote the empty word and let A"*" = A*\{e} be the set of non- 
empty words. If w G A* , denote by \w\ the length of w, by 'w{i) the ith symbol 
in w (indexed from 0), and by w{i,j) the word w{i)w(i -{- \)...w(j). Also, let 
A-" ~ {w € A* ^ \'w\ < n} be the set of words of length no greater than n. 
For w, w' G A* define the following relations: 

• w' is a subword of w if there exist < i,j < |w| such that w' = w{i,j). 

• w' is a subsequence of w if there exist < *i < ^2 < • • • < i\w'\ < 1^1 such 
that w' = ■w{ii)'wli2) ■ ■ ■ w{i\^:\). 

• ^{w,w') — {x eN^^ ^\xq <...< x\^>\_i and w{xi) = w'{i) ioi all i} is 

the number of ways in which w' is a subsequence of w. 

Notice that e is both a subword and subsequence of every word and that #{w, e) = 1 
for all w. As an example, note that while aabbaa does not contain aba as a subword, 
it does contain aba as a subsequence in 8 different ways. 

We will say that an infinite sequence of symbols from A, call it s, is computable 
if there is a Turing Machine which, on input n, produces s{n). 

When dealing with sets, we will use ViX) to denote the collection of all subsets 
of X and Vr{X) to denote all subsets of X of cardinality r. 

We will also be dealing with polynomials in a non-commutative polynomial ring, 
so let / G K{xi, . . . ,Xd) be a polynomial over a field, K, with non-commuting 
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indeterminates. We say that / is homogeneous of degree n if / is a iiT-Unear 
combination of monomials of the form 

n-i no ni. 

II 12 Ik 

with Til + n2 + ■ ■ ■ + nj^ = n. In this case, we write d{f) = n. 

4.1.2. A Combinatorial Lemma. When we get to the actual construction, we are 
going to need to find a subsequence of each computable sequence that satisfies 
certain properties. In particular, we need the following lemma. 

Lemma 4.2. Let A = {xi, . . . ,x„i} and let n > 1. There is some C{ni,n) such 
that for all words, s, over alphabet A with \s\ > C'{m,n), s contains a non-empty 
suhword, w, with the following property. For each w' G A-" n A'^ , w contains w' 
as a subsequence an even number of times. 

In fact, the number C{m,n) is a Ramsey number, _R(2,3,2 ™-i ) where the 
function R{r, k,n) is given in Ramsey's Theorem, 

Theorem 4.3 (Ramsey). Let r, k, n be positive integers with 1 < r < k. Then there 
exists an integer, denoted R{r, k, n), such that for each set X with \X\ = R{r, k, n) 
and each partition ofVr{X), Yi,...,y„, there exists a k-element subset Y of X 
and a set Yi with Vr iY) C Yi . 

To prove the lemma, we need a theorem from the combinatorial theory of words. 

Theorem 4.4 (Pirillo [9,). Let (p : yl+ -^ E be a mapping from A+ to a set E with 
\E\ ~ n. For each k > 1, each word w G A'^ of length R{2,k + l,n) contains a 
subword W1W2 ■ ■ .Wk with Wi G A'^ and 

(t>{w{i,i')) = (l){w{j,j')) 
for all pairs {i,i'), ij,j') (^ l£ i l£ i' ^ k and 1 < j < j' < k). 
Proof of Lemma \4^ Consider the function (j) : A+ — > Z^~ defined as follows: 

(0(u;)) {w') = #(w,w;') mod 2 

Since 

Z^- ^ 2 —1 

We can apply Theorem 14.41 with fc = 2 to s to get W1W2, a subword of s such that 

4>{wi) = (/)(W2) = 0(^1^2) 

Then the word W1W2 contains any non-empty sequence over A of length no greater 
than n as a subsequence an even number of times. 

We will prove something slightly stronger by induction on the length of the 
contained subsequence. Specifically, we will show that not only does W1W2 satisfy 
the theorem, but so do wi and ^2 individually. 

For the base case, let Xi be a subsequence of length one. Then 

(t>iwiW2)ixi) = 4){wi){xt) + 4){w2)ixt) = 24>{wi){xi) = mod 2 

since the number of occurrences of a single symbol in a concatenation of words is 
simply the sum of the number of occurrences in each factor. Using the fact that 
(j>{wiW2) — (p{wi) = 4>{w2), we have established the base. 
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The induction step isn't signifieantly more difBcult; the only difBcuhy arises from 
the fact that the number of occurrences of a substring of length greater than one 
isn't additive. However, we do have the formula, 

if{wiW2,w') = ^#(wi,w'(0,i - l))#{w2,w'{i,\w'\ - I)) 

i=0 

We really only care about the parity of this expression and if we assume the induc- 
tion hypothesis for all strings shorter than w' , most of the terms are even. So by 
reducing, 

#(wiW2, w') = #(wi,u/) + #(w2, w') mod 2 
Now, in an argument analogous to the base case, we get 

0(wiW2) = 4'i''^i) — 0(^2) — mod 2 

D 

4.1.3. The Golod-Shafarevich Theorem. The Golod-Shafarevich Theorem is a pow- 
erful tool from algebra that gives a sufficient condition for a particular quotient 
algebra to be infinite dimensional. For our purposes, it is a tool that will ensure 
that as we start adding relations to a free group, we don't collapse the group to 
something finite. 

The power of the theorem comes from the fact that the criterion it presents is 
based only on the number of relations of certain types, and not on the relations 
themselves. This gives us a large amount of freedom to choose the relations we 
want without having to worry about bad interactions between them. 

Theorem 4.5 (Golod-Shafarevich [6]). Let Rd — K{xi, . . . ,Xd) be the polyno- 
mial ring over a field, K, in the non-commuting indeterminates xi, . . . ,Xd. Let 
/ij/2,--- G F he a set of homogeneous polynomials of Rd, and let the number 
of polynomials of degree i be ri. Let 2 < d{fi) < d{fi+i) and let I he the ideal 
generated by F . Let Rd/ L — A. If all the coefficients in the power series, 

il-dt + ^ntA 

are non-negative, then A is infinite dimensional. 

In a subsequent paper, Golod [5] proves the following corollary. 
Corollary 4.6. In Theorem \4.5[ if 

r, <e^{d-2ey-^ 
where < e < |, then A is infinite- dimensional. 

For example, taking d = 2 and e = i in the corollary, we see that if r, < 2 for 
alH > 11 and r^ = for alH < 11, A is infinite dimensional. 

Golod used this fact to establish the existence of a Burnside group, that is, an 
infinite group in which every element has finite order. He did more than this, in 
fact, and produced an infinite p-group for each prime, p. The diagonalization in 
Section l4.1.4l will follow the same general ideas as Golod's construction as simplified 
for countable fields in Fischer and Struik [4]. 
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4.1.4. The Construction. In Fischer and Struik j4], there is a construction of a 
nil-algebra over finite and countable fields. Although not expressly stated there, 
the construction is essentially a diagonalization over all polynomials. In fact, even 
Golod's original construction of a nil-algebra can be viewed as a diagonalization over 
all polynomials, but the fact that the collection of all polynomials in the general 
case is uncountable makes it more difficult to see. 

The presentation given here will follow the construction in Fischer and Struik, 
since we will be diagonalizing against the set of computable sequences, which is 
countable. Thus, we will use the more straightforward construction. 

Theorem 4.7. There exists an inescapable group. 

Proof. Let d > 2 and A — {xi,. . . ,Xd}. Consider the algebra, A = F2(A) in 
non-commuting indeterminates, Xi 6 A. Let 

d 

S=\J{{l + x.,),{l + x.,)^^} 

i=\ 

and enumerate all c.e. sequences over S: sq, si, . . .. We will use the elements of S 
interchangeably as characters in an alphabet and as polynomials in A. 

We will construct a set of homogeneous polynomials, F ., such that the following 
conditions hold: 

• xY' E F ioi 1 < i < d. (This will ensure that (1 -I- Xi) has order 16 and is 
therefore multiplicatively invertible in the quotient algebra) 

• For each i eN, Si has a subword, w such that 

n ^(^) - 1 

is in the ideal generated by elements of F. 

• The number of elements of F of degree « is for i < 16, d ior i — 16, and 
either or 1 for every i > 16. 

Define the following sequence recursively: 

To = 16 

/ isr-n+i- 
r„+i = 15 -i?!^ 2, 3, 2^^-^ 

We can now enumerate the elements of F as follows. 

Start with F = {xl^, . . . jX^^^} and begin enumerating the elements of all Si in 
parallel. If, at any point, we have enumerated a contiguous subsequence of elements 
of some Si of length ri+i, call it v and do the following. 

By Lemma 14.21 we can find a non-empty subword, w, of v such that |u;| < -rrVi+i 
and every sequence of length < r.; occurs an even number of times in w. Then, by 
multiplying out, 

|to|-l \w'\-l 

pix,,...,xd):= n ^w= E #K^') n Kw-i) 

fc=o w'es* fc=o 

Note that if < |w'| < r^, ^{w,w') = mod 2 and that the constant term of 
p is 1. Note also that w'{k) — 1 always has constant term 0, so p — 1 has no 
non-zero terms with degree < r^. Since p is a polynomial, we can write p — 1 
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as a sum of homogeneous components, /i, . . . , fm- Note that for all 1 < i < m, 
ri < d{fi) < 15\w\ < ri+i. Add ah of these to F, stop enumerating Si but continue 
enumerating everything else that hasn't been similarly halted. 

The F that is so constructed clearly contains xj^ for 1 < i < d and for each Si , if 
Si is total, it has a subword whose corresponding product is equivalent to 1 mod the 
ideal generated by F. Also, each Si only adds polynomials to F that have degree in 
the interval {ri,ri^i] and adds at most one polynomial of any given degree. Since 
we start with d polynomials of degree 16 and tq = 16, the number of elements of 
F of degree i must be for all i < 16 and cither or 1 for every i > 16. 

We would then like to show that the multiplicative semigroup of A/{F) generated 
by elements of S, call it G, is an inescapable group. First, note that by the binomial 
theorem, 

(l+x0(l + x,)i5 = l + xi6 = l 
so G is a genuine group and S is closed under inverses. S trivially generates G, so 
we only need G to be infinite for (G, S) to be a tape graph. 

By elementary calculus, 

2<d< — (d-.5)'-2 
- - 16^ ' 

for alH > 11. Therefore, F satisfies the hypotheses of Corollary 14.61 and A/{F) is 

infinite-dimensional. 

Now, for any positive integer, d, there must be a monomial of degree d that does 

not lie in the ideal generated by F. Otherwise, every monomial of degree > d would 

be in the ideal and the quotient algebra would be finite-dimensional. So, consider 

two generic such monomials, 

•^ii '^12 • ■ • "^iM anci -ijj^ J2 ' ■ • j N 

with M > N and the group elements, 

M = (1 + .TiJ(l +Xi^) ... (1 +Xi^,) and v = {I + XjJ(l + Xj^) ... (1 + Xj„) 

Then, 

U V — Xi-^Xi2 • • • -^iM ~r • ■ • 

where the remaining terms all have degree < M. Since the ideal (F) is generated 
only by homogeneous polynomials, u — v G (F) if and only if every homogeneous 
component of u — ti is in (F). However, the degree M component of u — w is clearly 
not in {F), so neither is u — v. Therefore, u and v are different elements in G. 

Thus, we can find infinitely many distinct elements of G, one for each degree. 
Therefore, G is infinite and (G, S) is a tape graph. 

In addition, any computable sequence of generators is one of the Si, so we have 
ensured that it has a subword such that the corresponding product is equal to 1 
in the quotient algebra. This is the same as having product 1 in the group, so all 
computable sequences of generators must correspond to self-intersecting paths. D 

The construction given above in fact does better than producing an inescapable 
group. Since every step of the construction can be done recursively, the set of 
relations in the group is r.e. It is a standard result that a group with an r.e. 
set of relations is recursively presentable, so the construction produces a recursively 
presentable inescapable group. That being said, the question of whether there exists 
a finitely presentable inescapable group remains open. It is also unlikely that the 
word problem for the group constructed above is solvable, so there also remains the 
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question of whether there exists an inescapable group with solvable word problem. 
It should also be noted that the only property of computable sequences that the 
construction used was that there are countably many of them. Thus, the argument 
relativizes. In particular, the construction will produce a group with no escape in 
Turing degree T, but with a presentation in T. 

Since the presentation is in T, the set of escapes is Hi in T. You can see this 
by observing that if an infinite sequence has a self-intersection, we will eventually 
know about it. Trivially, there are escapes, so by the low basis theorem, there must 
be a low escape. So, for any Turing degree, we have a group with no escapes in or 
below the given degree, but with an escape low relative to it. 

4.2. Inescapable Groups are Burnside Groups. It is clear that in an in- 
escapable group, all generators must have finite order, but must every group element 
have finite order? An infinite group in which every element has finite order is called 
a Burnside group and the existence of Burnside groups was an open problem for 
some time before E. Golod constructed one in 1964 |6l|5]. 

So our question can be rephrased as, must every inescapable group be a Burnside 
group? It turns out the answer is yes, but it's not as obvious as it appears. Suppose 
your group did have an element of infinite order. The obvious thing to do would be 
to write this element of infinite order as a product of generators and simply repeat 
that sequence to produce an escape. This certainly gives a computable sequence of 
generators that hits infinitely many elements of the group, but there is no reason 
this is a non-self-intersecting path. Fortunately, we can find some other element of 
infinite order and an expression of it as a product of generators such that this naive 
construction does give a non-self-intersecting path. 

The general idea is to start with the obvious construction and cut out the loops. 
Since our original element has infinite order, the constructed walk can only return 
to a given point a fixed, finite number of times. So, we wait at each point of the 
walk until it returns to our current position for the last time, then we follow for 
one step, and repeat. The only difficulty is knowing when the last return will be. 
Fortunately, this is invariant under shifting by our infinite order element, so we can 
just record it in a finite table indexed by the generators in the expression of our 
infinite order element. 

Theorem 4.8. Let G = (go,---,5n} be a group and a E G have infinite order. 
Then there exists b = Y[j=o dij ^ ^ such that 11 1=0 dij mod k ^^ distinct for each N . 

Proof. Let a ~ ni=o ^^ with hi e {go, ■ ■ • ,5n} be an expression for a of minimal 
length. Define 

(m-l \ f s-l 

n h, «*' n ^^ 
i=r+l / \j=0 

and consider relations of the form 5{r, s, M) — e with A/ > 0. Fixing r and s, there 
is at most one M for which 5{r,STM) = e since a has infinite order. Similarly, 
fixing r and M , there is at most one s for which 5{r, s, M) — e since we chose an 
expression for a of minimal length. This allows us to define the following functions, 

a : [0, TO - 1] ^ P{N X [0, m - 1]), /3 : [0, to - 1] ^ N, and 7 : [0, m - 1] ^ [0, m-l] 
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by 

a{r) = {{M,s)\6{r,s,M) ^e} 

I max(jv/.s)eQ(r) M otherwise 

^ fs if (/3(r),s) ea(r) 

1 (r + 1) mod ?7i if a{r) = 

Notice that a{r) is always a finite set since s can take on at most finitely many 
values and for each value, there is at most one M such that (M, s) e a(r). There- 
fore, whenever a{r) is non-empty, /3(r) exists and is attained by exactly one element, 
(/?(r), s') of a(r). By definition, "f{r) — s' , so 7 is well-defined. 

Lemma 4.9. For all n, there is a kn such that 

n 

||^7(')(m-l) = a "^0 • • •^7(")(m-l) 
i=l 

where the sequence {kn} is defined by 
ko = -1 

{0 if a(7(") (m - 1)) = and 7(") (m - 1) 7^ m - 1 
1 i/ a(7(") (m - f )) = and 7(") (m - 1) = m - 1 
/3{-/^"^{m — f )) + f otherwise 

Proof. We proceed by induction. If n = 0, the product on the left side is empty, so 
we can take ko — —1. 

For the induction step, suppose the lemma holds for n — 1. Then we can apply 
the induction hypothesis to reduce the problem to 

a "^^ho ■ ■ ■ ^7("-i)(m-l)^7(")(m-l) ~ O, " ho ■ ■ ■ ^7(")(m-l) 

or more simply, 

ho ■ ■ ■ ^7("-i)(m-l) — a " ""^/lo ■ • • 'l7(")(m-l)-l 

If a{'f^"^^'{m — 1)) is empty, then 

7(")(m-l) = (7("-i)(m-l) + l)modm 

If 7("^i) (m — 1) = m — 1, then the product on the left is a and the product on the 
right is a*=n^*=n-i^ so take A:„ — fcn-i + 1- Otherwise, take fc„ — k^-i- 

On the other hand, if a(7^"^"'^'' {ra—l)) is non-empty, 7^"-' {ra—1) = s for {M, s) G 
a(7("-i)(m- 1)) where M = ;3(7(")(m-1)). Since {M,s) € a(7("-i)(w- 1)), 

= ^0 • • • ^7("-i)(m-l)^7("-i)(m-l) + l • • ■ ''■m-1'2 ho---hs-l 

— a fto . . . Ai^(i.)(m_i)_i 
so we can take kn ^ M + I + fc„_i = /3(7(")(m - 1)) + 1 + fc„_i. D 



Lemma 4.10. The sequence hj(ni-i)ihj(~^(m-i)),h^{3}i^ni~i)i ■ ■ ■ corresponds to a 
non-self-intersecting path 
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Proof. Suppose HiLi hfi''>{m.-i) — IliLi ^7(»)(m-i) with/2 > ^1- Then, by Leinnia|4^ 

or, 

Then there are two cases. 

First, if fcfj — ki^, then ^^^^''{m — 1) = ^^'•'^\ra — 1) by the minimahty of the 
expression for a. In addition, for all i with li < i < h, a{j^'^'{m — 1)) = and 
7^*)(m — 1) ^ TO — 1. Therefore, for all such i, 

^(i+i)(j„ - 1) = 7"^*) (to - 1) + 1 mod m 

Since 7'^'i^(to, — 1) = ^^'■^''{rn — 1), I2 — h > m-- But then for some h < j < h, 
^(o)(jji — 1) = m — 1, which is a contradiction. 

Second, if /cji > h^, then {h^-h^ - 1,7('=)(to- 1) + 1) G a(7('i)(TO- 1)). Then 
fc/1+1 > fcii + {ki^ — ki-i — 1) + 1 = A:;^. Since ^2 > ^1 + 1 and the sequence of fc's 
is non-decreasing, I2 = h + 1. We can then calculate 7('i+^)(to — 1) directly. Note 
that 

(/3(7('i)(TO-l)),7('^)(m-l) + l) = (fc,,+i-fcj, -l,7('^)(m-l) + l) 

= {ki,-ki,^l,-f^''\m-l) + l) 
£ a(7('i)(TO-l)) 

So 

^('i+i)(m - 1) = 7('^)(m - 1) + 1 = 7('i+^)(m - 1) + 1 
which is again a contradiction. D 

Since 7 is a function from a finite set to itself, the sequence, 

7(771 — 1), 7(7(771 — 1)), 7^-^(777 — 1) . . . 

is eventually periodic. Let j^^^''{m — 1), ... , 7*^^^) (777 — 1) be a single period and take 
b — Jlili '*7(»)(m-i)- This expression of 6 as a product of generators clearly fits 
the theorem. D 

Corollary 4.11. Inescapable groups are Burnside groups 

Proof. Suppose (G, 5) is a tape graph and G is not a Burnside group. Then G has 
an element of infinite order. By Theorem 14.81 "^6 can find some 79, ... , ik~i such 
that 

N 

J_ J_ 9ij mod k 

is distinct for each N. The sequence {ffi ,-„odfc}jeN is computable (in fact, it's 
regular), so G is not inescapable. D 
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4.3. Further Remarks. As was previously mentioned, generalizing to multiple 
tapes and multiple heads in this context is done in the same way as in the context 
of standard Turing machines. For example, a one-tape machine with multiple heads 
can be simulated by a one-tape machine with one head. The locations of all the 
heads can simply be marked on the tape with special symbols and for each step 
of the simulated machine, the simulating machine can search through the entire 
tape for all the head symbols and then update the tape accordingly. Similarly, a 
machine with multiple, identical tapes can be simulated by a machine with a single 
tape of the same type. Simply enlarge the tape alphabet to tuples of tape symbols 
together with special symbols for the head on each tape and follow the procedure 
for multiple heads. 

The only major difference is in a machine with multiple, different tapes. In this 
case, the class of functions computable by the machine is the class of functions in 
or below the join of the r.e. degrees of the word problems of the tapes. Just as in 
Section [3.41 this machine is mutually simulatable with a standard Turing machine 
with oracles for the word problem of each tape. In fact, this case subsumes the 
multiple-head, single-tape and multiple-head, multiple-tape cases. 
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